Picture for Minda Hu

Minda Hu

Dynamic Mixture of Latent Memories for Self-Evolving Agents

Add code
May 21, 2026
Viaarxiv icon

PlanningBench: Generating Scalable and Verifiable Planning Data for Evaluating and Training Large Language Models

Add code
May 20, 2026
Viaarxiv icon

Saliency-R1: Enforcing Interpretable and Faithful Vision-language Reasoning via Saliency-map Alignment Reward

Add code
Apr 06, 2026
Viaarxiv icon

Search-R2: Enhancing Search-Integrated Reasoning via Actor-Refiner Collaboration

Add code
Feb 03, 2026
Viaarxiv icon

Probability-Entropy Calibration: An Elastic Indicator for Adaptive Fine-tuning

Add code
Feb 02, 2026
Viaarxiv icon

ConMax: Confidence-Maximizing Compression for Efficient Chain-of-Thought Reasoning

Add code
Jan 08, 2026
Viaarxiv icon

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

Add code
Jul 28, 2025
Figure 1 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 2 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 3 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Figure 4 for A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
Viaarxiv icon

WebCoT: Enhancing Web Agent Reasoning by Reconstructing Chain-of-Thought in Reflection, Branching, and Rollback

Add code
May 26, 2025
Viaarxiv icon

A Survey of Personalized Large Language Models: Progress and Future Directions

Add code
Feb 17, 2025
Viaarxiv icon

NILE: Internal Consistency Alignment in Large Language Models

Add code
Dec 21, 2024
Viaarxiv icon